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AAGTAAAAGAAAGAGCGAGAAATCATCGAAATGGATTTCATCTCATCTCTTATCGTTGGC 
1 + + + + ^ ^ go 

TTCATTTTCTTTCTCGCTCTTTAGTAGCTTTACCTAAAGTAGAGTAGAGAATAGCAACCG 

a K*KKEREIIE(g)DFISSLIVG 

b SKRKSEKSSKWISSHLLSLA- 

C VKERARNHRNGFH LISYRWL- 

TGTGCTCAGGTGTTGTGTGAATCTATGAATATGGCGGAGAGAAGAGGACATAAGACTGAT 
61 + + -+ + + + 120 

ACACGAGTCCACAACACACTTAGATACTTATACCGCCTCTCTTCTCCTGTATTCTGACTA 

a CAQVLCESMNMAERRGHK'TD 

b VLRCCVNL*IWRREEDIRLI- 

C CSGVV*IYEYGGEKRT*D*S- 

CTTAGACAAGCCATCACTGATCTTGAAACAGCCATCGGTGACTTGAAGGCCATACGTGAT ^ 

121 + + + + + + 180 

GAATCTGTTCGGTAGTGACTAGAACTTTGTCGGTAGCCACTGAACTTCCGGTATGCACTA 

a L. RQAITDLETAIGDLKAIRD 

b LDKPSLILKQ PSVT*RPYVM- 

c *TSHH*S*NSHR*LEGHT**- 

GACCTGACTTTACGGATCCAACAAGACGGTCTAGAGGGACGAAGCTGCTCAAATCGTGCC 

181 + + + + + + 240 

CTGGACTGAAATGCCTAGGTTGTTCTGCCAGATCTCCCTGCTTCGACGAGTTTAGCACGG 

a DLTLRIQQDGLEGRSCSNRA 

b T*LYGSNKTV*RDEAAQIVP- 

c PDFTDPTRRSRGTKLLKSCQ- 

AGAGAGTGGCTTAGTGCGGTGCAAGTAACGGAGACTAAAACAGCCCTACTTTTAGTGAGG 
241 + + + + + + 300 

TCTCTCACCGAATCACGCCACGTTCATTGCCTCTGATTTTGTCGGGATGAAAATCACTCC 

a REWLSAVQVTETKTALL LVR 

b ESGLVRCK*RRLKQPYF**G- 

c RVA*CGASNGD*NSPTFSEV- 

TTTAGGCGTCGGGAACAGAGGACGCGAATGAGGAGGAGATACCTCAGTTGTTTCGGTTGT 

301 + + + + + + 360 

AAATCCGCAGCCCTTGTCTCCTGCGCTTACTCCTCCTCTATGGAGTCAACAAAGCCAACA 

a FRRREQRTR MRRRYLSCFGC 

b LGVGNRGRE*GGDTSVVSVV- 

c *ASGTEDANEEEIPQLFRLC- 

GCCGACTACAAACTGTGCAAGAAGGTTTCTGCCATATTGAAGAGCATTGGTGAGCTGAGA 

361 + + + + + + 420 

CGGCTGATGTTTGACACGTTCTTCCAAAGACGGTATAACTTCTCGTAACCACTCGACTCT 



Fig. 2A 
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a ADYKLCKKVSAILKSIGELR 

b PTTNCARRFLPY*RALVS*E- 

c RLQTVQEGFCHIEEH W*AER- 

GAACGCTCTGAAGCTATCAAAACAGATGGCGGGTCAATTCAAGTAACTTGTAGAGAGATA 
421 — -+ + + + + + 

CTTGCGAGACTTCGATAGTTTTGTCTACCGCCCAGTTAAGTTCATTGAACATCTCTCTAT 

a ERSEAIKTDGGSIQVTCREI 

b NALKLSKQMAGQFK*LVERY- 

C TL*SYQNRWRVNSSNL*RDT- 

CCCATCAAGTCCGTTGTCGGAAATACCACGATGATGGAACAGGTTTTGGAATTTCTCAGT 
481 + + + + + + 

GGGTAGTTCAGGCAACAGCCTTTATGGTGCTACTACCTTGTCCAAAACCTTAAAGAGTCA 

a PIKS VVGNTTMMEQVLEFLS 

b PSSPLSEIPR* WNRFW NFSV- 

C HQVRCRKYHDDGTGFGISQ*- 

GAAGAAGAAGAAAGAGGAATCATTGGTGTTTATGGACCTGGTGGGGTTGGGAAGACAACG 
541 + + + + + gQO 

CTTCTTCTTCTTTCTCCTTAGTAACCACAAATACCTGGACCACCCCAACCCTTCTGTTGC 

a EEEERGIIGVY GP GGVGKTT 

b KKKKEESLVFMDLVGLGRQR- 

C RRRKRNHWCL WTWWGWEDNV- 

TTAATGCAGAGCATTAACAACGAGCTGATCACAAAAGGACATCAGTATGATGTACTGATT 
601 + + + + + + 660 

AATTACGTCTCGTAATTGTTGCTCGACTAGTGTTTTCCTGTAGTCATACTACATGACTAA 

a LMQSINNELITKGHQYDVLI 

b *CRALTTS*SQKDISMMY*F- 

c NAEH*QRADHKRTSV*CTDL- 

TGGGTTCAAATGTCCAGAGAATTCGGCGAGTGTACAATTCAGCAAGCCGTTGGAGCACGG 

661 . + + + + + + 720 

ACCCAA6TTTACAGGTCTCTTAAGCCGCTCACATGTTAAGTCGTTCGGCAACCTCGTGCC 

WVQMSREFGECTIQQAVGAR 
GFKCPENSASVQFSKPLEHG- 
GSNVQRIRRVYNSASRWSTV- 

TTGGGTTTATCTTGGGACGAGAAGGAGACCGGCGAAAACAGAGCTTTGAAGATATACAGA 
+ ,^ ^ ^ ^ 780 

AACCCAAATAGAACCCTGCTCTTCCTCTGGCCGCTTTTGTCTCGAAACTTCTATATGTCT 

LGLSWDEKETGENRALKIYR 
WVYLGTRRRPAKTEL*RYTE- 
GFILGREGDRRKQSFEDIQS- 

GCTTTGAGACAGAAACGTTTCTTGTTGTTGCTAGATGATGTCTGGGAAGAGATAGACTTG 
+ ^ + ^ ^ ^ 840 

CGAAACTCTGTCTTTGCAAAGAACAACAACGATCTACTACAGACCCTTCTCTATCTGAAC 

Fig. 2B 
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a ALRQ RFLLLLDDVWEEIDL 

b L*DP rVSCCC*MMSGKR*TW- 

c FET TFLVVAR*CLGRDRLG- 

GAGAAAACTGGAGTTCCTCGACCTGACAGGGAAAACAAATGCAAGGTGATGTTCACGACA 
841 + — + — _+ + 

CTCTTTTGACCTCAAGGAGCTGGACTGTCCCTTTTGTTTACGTTCCACTACAAGTGCTGT 

a EKTGVPR PDRENKCKVMFTT 

b RKLEFLDLTGKTNAR*CSRH- 

C ENWSSST*QGKQMQGDVHDT- 

CGGTCTATAGCATTATGCAACAATATGGGTGCGGAATACAAGTTGAGAGTGGAGTTTCTG 
901 + + + + + + 960 

GCCAGATATCGTAATACGTTGTTATACCCACGCCTTATGTTCAACTCTCACCTCAAAGAC 

a RSIALCN NMGAEYKLRVEFL 

b GL* HYATIWVRNTS*EWSFW-- 

c VYSIMQQYGCGIQVESGVSG" 

GAGAAGAAACACGCGTGGGAGCTGTTCTGTAGTAAGGTATiSGAGAAAAGATCTTTTAGAG 
961 + + + + + + 1020 

CTCTTCTTTGTGCGCACCCTCGACAAGACATCATTCCATACCTCTTTTCTAGAAAATCTC 

a E KKHAWELFC SKVWRKDLLE 

b RRNTRGSCSVVRYGEKIF*S- 

c EETRVG AVL**GMEKRSFRV- 

TCATCATCAATTCGCCGGCTCGCGGAGATTATAGTGAGTAAATGTGGAGGATTGCCACTA 

1021 + + + + + + 1080 

AGTAGTAGTTAAGCGGCCGAGCGCCTCTAATATCACTCATTTACACCTCCTAACGGTGAT 

a SSSIRRLAEIIVSKCGGLPL 

b HHQFAGSRRL**VNVEDCH*- 

c IINSPARGDYSE*MWRIATS- 

GCGTTGATCACTTTAGGAGGAGCCATGGCTCATAGAGAGACAGAAGAAGAGTGGATCCAT 

1081 + + + + + + 1140 

CGCAACTAGTGAAATCCTCCTCGGTACCGAGTATCTCTCTGTCTTCTTCTCACCTAGGTA 

a ALITLGGAMAHRETEEEWIH 

b R *SL*EEPWLIERQKKSGSM- 

C VDHFRRSHGS*RDRRRVDPC- 

GCTAGTGAAGTTCTGACTAGATTTCCAGCAGAGATGAAGGGTATGAACTATGTATTTGCC 

1141 + + + + + 1200 

CGATCACTTCAAGACTGATCTAAAGGTCGTCTCTACTTCCCATACTTGATACATAAACGG 

a ASEVLTRFPAEMKGMNYVFA 

b LVKF*LDFQQR*RV*TMYLP- 

c **SSD*ISSRDEGYELCICP- 

CTTTTGAAATTCAGCTACGACAACCTCGAGAGTGATCTGCTTCGGTCTTGTTTCTTGTAC 

1201 + + + + + + 1260 

GAAAACTTTAAGTCGATGCTGTTGGAGCTCTCACTAGACGAAGCCAGAACAAAGAACATG 

Fig. 2C 
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a LLKFSY DNLESDLLRSCFLY 

b F*NSATTTSRVICFGLVSCT- 

c FEIQLRQPRE*SASVLFLVL- 

TGCGCTTTATTCCCAGAAGAACATTCTATAGAGATCGAGCAGCTTGTTGAGTACTGGGTC 

1261 + + + + + + 1320 

ACGCGAAATAAGGGTCTTCTTGTAAGATATCTCTAGCTCGTCGAACAACTCATGACCCAG 

a CALFPEEHS IEIEQLVEYWV 

b ALYS.QK NIL*RSSSLLSTGS- 

C RFIPRRTFYRDRAAC*VLGR- 

GGCGAAGGGTTTCTCACCAGCTCCCATGGCGTTAACACCATTTACAAGGGATATTTTCTC 

1321 + + + + + + 1380 

CCGCTTCCCAAAGAGTGGTCGAGGGTACCGCAATTGTGGTAAATGTTCCCTATAAAAGAG 

a GEGFLTSSHGVNTIYKGYFL 

b AKGFSPAPMALTPFTRDIFS- 

C R RVSHQLPWR*HHLQ GIFSH- 

ATTGGGGATCTGAAAGCGGCATGTTTGTTGGAAACCGGAGATGAGAAAACACAGGTGAAG 

1381 + + + + + 1440 

TAACCCCTAGACTTTCGCCGTACAAACAACCTTTGGCCTCTACTCTTTTGTGTCCACTTC 

a IGDLKAACLLETGDEKTQVK 

b LGI*KRHVCWKPEMRKHR*R- 

c WGSESGMFVGNRR*E NTGED- 

ATGCATAATGTGGTCAGAAGCTTTGCATTGTGGATGGCATCTGAACAGGGGACTTATAAG 

1441 + -+ + + + + 1500 

TACGTATTACACCAGTCTTCGAAACGTAACACCTACCGTAGACTTGTCCCCTGAATATTC 

a MHNVVRSFALWMASEQGTYK 

b CIMWS EALHCGWHLNRGLIR- 

c A*CGQKLCIVDGI*TGDL*G- 

GAGCTGATCCTAGTTGAGCCTAGCATGGGACATACTGAAGCTCCTAAAGCAGAAAACTGG 

1501 + + + + + + 1560 

CTCGACTAGGATCAACTCGGATCGTACCCTGTATGACTTCGAGGATTTCGTCTTTTGACC 

a ELILVEPSMGHTEAPKAENW 

b S*S*LSLAWDILKLLKQKTG- 

c ADPS*A*HGTY*SS*SRKLA- 

CGACAAGCGTTGGTGATCTCATTGTTAGATAACAGAATCCAGACCTTGCCTGAAAAACTC 

1561 + + + + + + 1620 

GCTGTTCGCAACCACTAGAGTAACAATCTATTGTCTTAGGTCTGGAACGGACTTTTTGAG 

a RQALVISLLDNRIQTLPEK L 

b DKRW*SHC*ITESRPCLKNS- 

C TSVGDLIVR*QNPDLA*KTH- 



Fig. 2D 
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ATATGCCCGAAACTGACAACACTGATGCTCCAACAGAACAGCTCTTTGAAGAAGATTCCA 

1621 + + + + + + 1680 

TATACGGGCTTTGACTGTTGTGACTACGAGGTTGTCTTGTCGAGAAACTTCTTCTAAGGT 

a ICPKLTTLMLQQNSSLKKIP 

b YARN*QH*CSNRTAL*RRFQ- 

c MPETDNTDAPTEQLFEEDSN- 

ACAGGGTTTTTCATGCATATGCCTGTTCTCAGAGTCTTGGACTTGTCGTTCACAAGTATC 

1681 + +- — + + + + 1740 

TGTCCCAAAAAGTACGTATACGGACAAGAGTCTCAGAACCTGAACAGCAAGTGTTCATAG 

a T6FFMHMPVLRVLDLSFTSI 

b QGFSCICLFSESWTCRSQVS- 

c RVFHAYACSQSLGLVVHKYH- 

ACTGAGATTCCGTTGTCTATCAAGTATTTGGTGGAGTTGTATCATCTGTCTATGTGAGGA 

1741 + + + + + + 1800 

TGACTCTAAGGCAACAGATAGTTCATAAACCACCTCAACATAGTAGACAGATACAGTCCT 

a TEIPLS IKYLVELYHLSMSG 

b LRF RCLSSIWWSCIICLCQE- 

c *DSVV YQVFGGVVSSVYVRN- 

ACAAAGATAAGTGTATTGCCACAGGAGCTTGGGAATCTTAGAAAACTGAAGCATCTGGAC 

1801 + -+ + + + + I860 

TGTTTCTATTCACATAACGGTGTCCTCGAACCCTTAGAATCTTTTGACTTCGTAGACCTG 

a TKISVLPQELGNLRKLKHLD 

b QR*VYCHRSLGILEN*SIWT- 

c KDKCIATGAWES*KTEAS GP- 

CTACAAA6AACTCAGTTTCTTCAGACGATCCCACGA6ATGCCATATGTTGGCTGAGCAAG 

1861 + + + + + + 1920 

GATGTTTCTTGAGTCAAAGAAGTCTGCTAGGGTGCTCTACGGTATACAACCGACTCGTTC 

a LQRTQFLQTIPRDAICWLSK 

b YKELSFFRR SHEMPYVG*AS- 

c. TKNSVSSDDPTRCHMLAEQA- 

CTCGA6GTTCTGAACTTGTACTACAGTTACGCCGGTTGGGAACTGCAGAGCTTTGGAGAA 

1921 + + + + + + 1980 

GAGCTCCAAGACTTGAACATGATGTCAATGCGGCCAACCCTTGACGTCTCGAAACCTCTT 

a LEVLNLYYSYAGWELQ SFGE 

b SRF*TCTTVTPVGNCRALEK- 

c RGSELVLQLRRLGTAELWRR- 

GATGAAGCAGAAGAACTCGGATTCGCTGACTTGGAATACTTGGAAAACCTAACCACACTC 

1981 + + + + + + 2040 

CTACTTCGTCTTCTTGAGCCTAAGCGACTGAACCTTATGAACCTTTTGGATTGGTGTGAG 



Fig. 2E 
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a DEAEELGF. ADLEYLENLTTL 

b MKQKNSDSLTWNTWKT*PHS- 

C *SRRTRIR*LGILGKPNHTR- 

GGTATCACTGTTCTCTCATTGGAGACCCTAAAAACTCTCTTCGAGTTCGGTGCTTTGCAT 

2041 + + -+ + + + 2100 

CCATAGTGACAAGAGAGTAACCTCTGGGATTTTTGAGAGAAGCTCAAGCCACGAAACGTA 

a GITVLSLETLKTLFEFGALH 

b VSLFSHWRP*KLSSSSVLCI- 

C YHCSLIGDPKNSLRVRCFA*- 

AAACATATACAGCATCTCCACGTTGAAGAGTGCAATGAACTCCTCTACTTCAATCTCCCA 

2101 + + + + + + 2160 

TTTGTATATGTCGTAGAGGTGCAACTTCTCACGTTACTT6AGGAGATGAAGTTAGAGG6T 

a KHIQHLHVEECNELLYFNLP 

b NIYSISTLKSAMNSSTSISH- 

c TYTASPR*RVQ*TPLLQSPI- 

TCACTCACTAACCATGGCAGGAACCTGAGAAGACTTAGCATTAAAAGTTGCCATGACTTG 

2161 + + + + + + 2220 

AGTGAGTGATTGGTACC6TCCTTGGACTCTTCTGAATCGTAATTTTCAACGGTACTGAAC 

a SLTN HGRNLRRLSIKSCHDL 

b HSLTM AGT*EDLALKVAMTW- 

C TH*PWQEPEKT*H*KLP*LG- 

GAGTACCTGGTCACACCCGCA6ATTTTGAAAATGATTGGCTTCCGAGTCTAGAGGTTCTG 

2221 + + + + +— + 2280 

CTCAT66ACCAGTGTGGGCGTCTAAAACTTTTACTAACC6AAGGCTCAGATCTCCAAGAC 

a EYLVTPADFENDWLPSLEVL 

b STWSHPQILKMIGFRV* RF*- 

c VPGHTRRF*K*LASESRGSD- 

AC6TTACACAGCCTTCACAACTTAACCAGAGTGTGGGGAAATTCTGTAAGCCAA6ATTGT 

2281 + + + + + + 2340 

TGCAATGTGTCGGAAGTGTTGAATTGGTCTCACACCCCTTTAAGACATTCGGTTCTAACA 

a TLHSLHNLTRVWGNSVSQDC 

b RYTAFTT*PECGEIL*AKIV- 

c VTQPSQLNQSVGKFCKPRLS- 

CTGCGGAATATCCGTTGCATAAACATTTCACACTGCAACAAGCT6AAGAATGTCTCATGG 

2341 + + + + + 2400 

GACGCCTTATAGGCAACGTATTTGTAAAGTGTGACGTTGTTCGACTTCTTACAGAGTACC 

a LRNIRCINISHCNKLKNVSW 

b CGISVA*TFHTATS*RMSH G- 

C AEYPLHKHFTLQQAEECLMG- 

GTTCAGAAACTCCCAAAGCTAGAGGTGATTGAACTGTTCGACTGCAGAGAGATAGAGGAA 

2401 + + + + + + 2460 

CAAGTCTTTGAGGGTTTCGATCTCCACTAACTTGACAAGCTGACGTCTCTCTATCTCCTT 

Fig. 2F 
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a VQKLPKLEVIELFDCREIEE 

b FRNSQS*R*LNCSTAER*RN- 

c SETPKARGD*TVRLQRDRGI- 

TTGATAAGCGAACACGAGAGTCCATCCGTCGAAGATCCAACATTGTTCCCAAGCCTGAAG 

2461 + + + + + + 2520 

AACTATTCGCTTGTGCTCTCAGGTAGGCAGCTTCTAGGTTGTAACAAGGGTTCGGACTTC 

a LISEHESPSVEDPTLFPSLK 

b **ANTRVHPSKIQHCSQA*R- 

c DKRTRESI RRRS NIVPKPED- 

ACCTTGAGAACTAGGGATCTGCCAGAACTAAACAGCATCCTCCCATCTCGATTTTCATTC 

2521 + + + + + + 2580 

T6GAACTCTTGATCCCTAGACGGTCTTGATTTGTCGTAGGAGGGTAGAGCTAAAAGTAAG 

a TLRTRDLPELNSILP SRFSF 

b P*ELGICQN*TASSHLDF HS- 

C LEN*GSARTKQHPPISIFIP- 

CAAAAAGTTGAAACATTAGTCATCACAAATTGCCCCAGAGTTAAGAAACTGCCGTTTCAG 

2581 + + + + + + 2640 

GTTTTTCAACTTTGTAATCAGTAGTGTTTAACGGGGTCTCAATTCTTTGACGGCAAAGTC 

a QKVETLVITNCPRVKK LPFQ 

b KKLKH*SSQIAPELRNCRFR- 

C KS*NISHHKLPQS*ETAVSG- 

GAGAGGAGGACCCAGATGAACTTGCCAACAGTTTATTGTGAGGAGAAATGGTGGAAAGCA 

2641 + + + + + + 2700 

CTCTCCTCCTGGGTCTACTTGAACGGTTGTCAAATAACACTCCTCTTTACCACCTTTCGT 

a ERRTQMNLPTVYCEEKWWKA 

b RGGPR*TCQQFIVRRN GGKH- 

c EEDPDELANSLL*GEMVEST- 

CTGGAAAAAGATCAACCAAACGAAGAGCTTTGTTATTTACCGCGCTTTGTTCCAAATTGA 

2701 + + + + + + 2760 

GACCTTTTTCTAGTTGGTTTGCTTCTCGAAACAATAAATGGCGCGAAACAAGGTTTAACT 

a LEKDQPNEELCYLPRFVPN* 

b WKKI NQ TKSFVIYRALFQID - 

c GKRSTKRRALLFTALCSKLI- 

TATAA6AGCTAAGAGCACTCTGTACAAATATGTCCATTCATAAGATGCAG6AAGCCAG6A 

2761 + + + + + + 2820 

ATATTCTCGATTCTCGTGAGACATGTTTATACAGGTAAGTATTCTACGTCCTTCGGTCCT 

a YKS*EHSVQICPFIR CRKPG 

b IRAKSTLYKyVHS*DAGSQE- 

C *ELRALCTNMSIHKMQEARK- 

AGGTTGTTCCAGTGAAGTCATCAACTTTCCACATAGCCACAAAACTAGAGATTATGTAAT 

2821 + + + + + + 2880 

TCCAACAAGGTCACTTCAGTAGTTGAAAGGTGTATCGGTGTTTTGATCTCTAATACATTA 

Fig. 2G 
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a RL FQ*SHQLST*PQN* RLCN 

b GCSSEVINFPHSHKTRDYVI- 

c VV PV K SSTFHIATKLEI M*S- 

CATAAAAACCAAACTATCCGCGA 

2881 + +— 2903 

GTATTTTTGGTTTGATAGGCGCT 

a HKNQTIR 

b IKTKLSA 

c *KPNYPR- 

ENZYMES THAT DO CUT: 

NONE 

ENZYMES THAT DO NOT CUT: 
Kpnl 



Fig. 2H 
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-146 . . 

ATCGATTGATCTCTGGCTCAGTGCGAGTAGTCCATTTGAGAGCAGTCGTAGCCCCGCGTG - 8 6 



GCGCATCATGGAGCTATTTGGAATTTTCGCAGGGTTATCGATTCGTAGTGGGAACCCATT - 2 6 

1 . . . . 

CATTGTTTGGAACCACCAACGGACGACTTAACAAGCTCCC CGAGGT GCATGATGAAAATT 35 

MetLysIle 



GCTCCAGTTGCCATAAATCACAGCCCGCTCAGCAGGGAGGTCCCGTCACACGCGGCACCC 95 
AlaProValAlalleAsnHisSerProLeuSerArgGluValProSerHisAlcLMaPro 

ACTCAGGCAAAGCAAACCAACCTTCAATCTGAAGCTGGCGATTTAGATGCAAGAAAAAGT 155 
ThrGlnAlaLysGlnThrAsnLeuGlnSerGluAlaGlyAspLeiiAspAlaArgLysSer 

AGCGCTTCAAGCCCGGAAACCCGCGCATTACTCGCTACTAAGACAGTACTCGGGAGACAC 215 
SerAlaSerSerProGluThrArgAlaLeuLeuAlaThrLysThrValLeuGlyArgHis 

AAGATAGAGGTTCCGGCCTTTGGAGGGTGGTTCAAAAAGAAATCATCTAAGCACGAGACG 275 
LysIleGluValProAlaPheGlyGlyTrpPheLysLysLysSerSerLysHisGluThr 

GGCGGTTCAAGTGCCAACGCAGATAGTTCGAGCGTGGCTTCCGATTCCACCGAAAAACCT 335 
GlyGlySerSerAlaAsnAlaAspSerSerSerValAlaSerAspSerThrGluLysPro 

TTGTTCCGTCTCACGCACGTTCCTTACGTATCCCAAGGTAATGAGCGAATGGGATGTTGG 395 
LeuPheArgLeuThrHisValProTyrValSerGlnGlyAsnGluArgMetGlyCysTrp 

TATGCCTGCGCAAGAATGGTTGGCCATTCTGTCGAAGCTGGGCCTCGCCTAGGGCTGCCG 455 
TyrAlaCysAlaArgMetValGlyHisSerValGluAlaGlyProArgLeuGlyLeuPro 

GAGCTCTATGAGGGAAGGGAGGCGCCAGCTGGGCTACAAGATTTTTCAGATGTAGAAAGG 515 
GluLeuTyrGluGlyArgGluAlaProAlaGlyLeuGlnAspPheSerAspValGluArg 

TTTATTCACAATGAAGGATTAACTCGGGTAGACCTTCCAGACAATGAGAGATTTACACAC 575 
PhelleHisAsnGluGlyLeuThrArgValAspLeuProAspAsnGluArgPheThrHis 



Fig. 3A 
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GAAGAGTTGGGTGCACTGTTGTATAAGCACGGGCCGATTATATTTGGGTGGAAAACTCCG 635 
GluGluLeuGlyAlaLeuLeuTyrLysHisGlyProIlellePheGlyTrpLysThrPro 



AATGACAGCTGGCACATGTCGGTCCTCACTGGTGTCGATAAAGAGACGTCGTCCATTACT 695 
AsnAspSerTrpHisMetSerValLeuThrGlyValAspLysGluThrSerSerlleThr 

• ••••• 

TTTCACGATCCCCGACAGGGGCCGGACCTAGCAATGCCGCTCGATTACTTTAATCAGCGA 755 
PheHisAspProArgGlnGlyProAspLeuiAlaMetProLetiAspTyrPheAsnGlnArg 

TTGGCATGGCAGGTTCCACACGCAATGCTCTACCGCTAAGTAGCAGGGTATCTTCACGTG 815 
LeuAlaTrpGlnVal Pr oHi sAl aMe tLeuTy r Ar gEnd 

GCGGCATCATGACAAGCCCATGATGCCGCCAGCAGCTACCTGAATGCCGTCTGGCTTTTT 875 



GGTCCCTATTGTCGTATCCGGAAGATGACGTCAAAGAATCTCGGCAAGAGCTTTCTTGCT 935 
CGACTCCTCAGCTTCCGGATCGATCAGGTCGCTTGCCAGAGCGCGCTTGTCCATGAGCAT 995 
CTGCCACAGCTGCTGGTCGATGGTGTCCTCAGCTAAAGGGATTTTGACGACAACCATGCG 1055 
CAACTGCCCGTTGCGATACGCTCGATCCTGAAGCCCCGGTGTCCATGGCAGCCCCAAGAA 1115 
AAAGACATAGTTCGCCGCTGT6AGGTTGTAGCCTGTGCCGGCGGCCGACCTGGTCCCGAT 1175 
AAACACCCTGCAGTCCGGATCCTGCTGGAAAGCATCAATCGCCTTCTGCCGCTTCTTGGG 1235 
CGAGTCACTGCCCACCAACGTCACGCACCCGACGCCAAGCTTGAGGCAGTGCTCCCGCAA 1295 

• B ■ ■ • 

CGTGGCCACGGATTCCTGATACTCGCAGAAGAGGATCACCTTGTCGTCGAC 1346 



Fig. 3B 
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1 50 
L6pro MSYLREVATA VALLLPFILL NKFWRPNSKD SIVNDDDDST SEVDAISDST 

Nprot M 

PrfP 

rps2 



51 



100 



L6pro NPSGSFPSVE YEVFLSFRGP DTREQFTDFL YQSLRRYKIM TFRDDDELLK 
Nprot ASSSsssRws yd |vflsfrg| e DTRKTFTSHL YEVLNDKGIK TFQDDKRLEY 

PrfP LRSKLDLIID LKHQIESVKE 

rps2 MDFISSLIV6 CAQVLCESMN MAERRGHKTD LRQAITDLET 



101 150 
L6pro GKEIGPNLLR AIDQSKIYVP IISSGYADSK WCLMELAEIV RRQEEDPRRI 
Nprot GATIP6ELCK AIEESQFAIV VFSENYATSR WCLNELVKIM ECK.TRFKQT 

PrfP GLLCLRSFID HFSESYDEHD EA CGLIARVSVM AYKAE 

rps2 AIGDLKAIRD DLTLRIQQDG LEGRSCSNRA REWLSAVQVT ETKTA 

15 1 7 200 

L6pro ILpIFYMVDP SpVRHQTGCY KKAFRKHANK F..DGQTIQN WKDALKKVGD 
Nprot Vi piFYPyPP Sl HVRNQKESF AKAFEEHETK YKDDVEGIQR WRIALNEAAN 

PrfP . YVIDS CLAYSHPLWY KVLW IS EVLENIKLV 

rps2 LLLVR FRRREQRTRM RRRY LSCFGCAD YKLCKKVSAI 



201 



8 250 



L6pro LKGWHIGKND KQGAIADKVS ADIWSHISKE NLILE...TD ELlVGIDDHlIT 

Nprot LKGSCDNRDK TDADCIRQIV DQISSKLCKI SLSY LQ Nl|vGIDTgLE 

PrfP NKVVGETCER RNIEVTVHEV AKTTTYVAPS FSAYTQRANE EME6FQDTID 
rps2 LKSIGELRER SEAIKTDGGS IQVTCREIPI KSVVG NTTMM 



251 



1 



L6pro AVLEKLSLDS ENVTMVGLYG MGGIGKTTTA 

Nprot KIESLLEIGI N6VRIMGIWG MGGVGKTTIA 

PrfP ELKDKLLGGS PELDVISIVG MPGLGKTTLA 

rps2 EQVLEFLSEE EERGIIGVYS PGGVGKTTLM 



P-IOOp 300 

KAVYNKI SSC.FDCC 

RAIFDTLLGR MDSSYQFDGA 
KKIYNDPEVT . . SRFDVHAQ 
QSINNELITK ..G HQY 



301 350 
L6pro CFIDNIRETQ EKDGWVLQK KLVSEILRID . . SGSVGFNN DSGGRKTIKE 

Nprot CFLKDIKE.. NKRGMHSLQN ALLSELLR EKANYNN EEDGKHQMAS 

PrfP CWTQLYSWR EL.LLTILND VLEP...S DRNEKED GE . lADELRR 

rps2 DVLIWVQMSR EF.GECTIQQ AVGA. ..RLG . .LSWDEKET GENRALKIYR 



351 



400 

SRFIIT SRISMRVLGTL 



L6pro RVSRFKILVV LDDVDEKFKF EDMLGSPKDF ISQ. 

Nprot RLRSKKVLIV LDDIDNKDHY LEYLAGDLDW FGNGSRIIIT TRIdKHLI . . 

PrfP FLLTKRFLIL IDDVWDYKVW DNLCMCFSD. VSNr |sRIIL T TRLNDVAEYV 

rps2 ALRQKRFLLL LDDVWEEIDL EKTGVPRPD. RENKCKVMF|T Tr|sIALCNNM 



Fig. 5A-1 
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L6pro 
Nprot 
PrfP 
rps2 



L6pro 
Nprot 
PrfP 
rps2 



L6pro 
Nprot 
PrfP 
rps2 



L6pro 
Nprot 
PrfP 
rps2 



401 450 

NEN.QCKLYE VGSMSKPRSL ELFSKHAFKK NT PPSY YETLANDWD 

.EK.NDIIYE VTALPDHESI QLFKQHAFGK EV PNEN FEKLSLEWN 

.KC.ESDPHH LRLFRDDESW TLLQKEVFQG E SCPPE LEDVGFEISK 

.6A.EYK.LR VEFLEKKHAW ELFCSKVWRK DLLESSSIRR LAEI...IVS 



451 



^ 500 

TTAIGLPLTLK VlIGSLLFKQE lAV. .WEDTL EQL RRT LNLDEVYDRL 

YAKGLPLALK VWGSLLHNLR LTE..WKSAI EHM KNN . SYSGIIDNV 

SCRGLPLSW LVA6VLKQKK KTLDSWKWE QSLS..SQRI GSLEESISII 
KCG6LPLALI TLG6AMAH.R ETEEEWIHAS EVLTRFPAEM K6MNYVFALL 



501 5 9 550 

KISYDALtNPE .AKEllFLDIA CFIFIGQ. .NK EEPYYMWTDC NFYPASNIIF 
KISYDGLEPK . QQEM IFLDIA CFj LRGE ♦ . EK DYILQILESC HIGAEYGLRI 
GFSYKNL.PH YLKPCFLYFG GFLQGKDIHD SKMTKLWVAE EFVQANN. . . 
KFSYDNLESD LLRSCFLYCA LFPEEHSIEI EQLVEYWVGE GFLTSSHGVN 



551 



10 



600 



LIQRGMIQVG DD DEFKMHDQLR DMGREIVRRE DVLPWKRSRI 

LIDKSLVFIS EY NQv dMHDLIQ DMGK YIVNFQ KD.PGERSRL 

EK GQEDTRTRF. .LGRSYW 

TIYKGYFLIG DLKAACLLET GDEKTQVKMH NWRSFALWM ASEQGTYKEL 



601 650 
L6pro WSAEEGIDLL LNKKGSSKVK AISI.PWGVK YEFK.SECFL NLSELRYLHA 
Nprot WLAKEVEEVM SNNTGTMAME AIWVSSYSST LRFS.NQAVK NMKRLRVFNM 

PrfP 

rps2 ILVEPSMGHT EAPKAENWRQ ALVISLLDNR IQTL.PEKLI CPKLTTLMLQ 

651 700 
L6pro REAMLT6DFN NLLPNLKWLE LPFYKH6EDD PPLTNYTMKN LII.VILEHS 
Nprot GRSSTHYAID YLPNNLRCFV CTNYPW. . .E SFPSTFELKM LVH.LQLRH. 

PrfP 

rps2 QNSSLKKIPT GFFMHMPVLR VLDLSF TSITEIPLSI KYL.VELYHL 

701 750 

L6pro HITADDWGGW RHMMKMAERL KWRLASNYS LYGRRVR 

Nprot NSL RHLWTETKHL PSL RRID 

PrfP 

rps2 SMSGTKISVL PQELGNLRKL KHLDLQRTQF LQTIPRDAIC WLSKLEVLNL 



L6pro 
Nprot 
PrfP 
rps2 



751 800 
.LSD.CWRFP KSIEVLSMTA lEMDEVDIGE LKKLKTLVLK FCPIQKISGG 
.LSW.SKRLT RTPDFTGMPN LEY. .VNLYQ CSNLEEVHHS LGCCSKVIGL 

YYSY.AGWEL QSFGEDEAEE LGFADLEYLE NLTTLGITVL SLETLKTLFE 
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801 850 
L6pro TF6MLK6LRE L.CLEFNWGT NLREWADIG QLSSLKVLKT TGAKEVEINE 
Nprot YLNDCKSLKR F PCVNVESLE 

PrfP 

rps2 FGALHKHIQH L.HVEECNEL LYFNLPSLTN HGRNLRRLSI KSCHDLEYLV 

851 900 

L6pro FPLGLK ELSTSSR IPNLSQLLDL EVLKVYDCKD GFDMPPASPS 

Nprot Y.LGLR SCDSLEK LPEIYGRMKP EI QIHMQGSGIR 

PrfP 

rps2 TPADFENDWL PSLEVLTLHS LHNLTRVWGN SVSQDCLRNI RCINISHCNK 

901 950 
L6pro EDESSVWWKV SKLKSLQLEK TRINVNWDD ASSGGHLPRY LLPTSLTYLK 
Nprot ELPSSIFQYK THVTKLLL.. .WNMKNLVAL PSSICRL KSLVSLS 

PrfP 

rps2 LKNVSWVQKL PKLEVIELFD CREIEELISE HESPSVEDPT LFP.SLKTLR 

951 1000 
L6pro lYQCTEPTWL P.GIENLENL TSLEVNDIFQ TLGGDLDGLQ GLRSLEILRI 
Nprot VSGCSKLESL PEEIGDLDNL RVFDASDTL ILRP 

PrfP 

rps2 TRDLPELNSI LPSRPSFQKV ETLVITNCPR VKKLPFQERR TQMNLPTVYC 

1001 1050 



L6pro RKVNGLARIK GLKDLLCSST CKLRKFYITE CPDLIELLPC ELGGQTVWP 
Nprot P SSI IRLNKLIILM FR6FKDGVHF EFPPVAEGLH 

PrfP 

rps2 EEKWWKALEK DQPNEELCYL PRFVPN 

1051 1100 
L6pro SMAELTIRDC PRLEVGPMIR SLPKFPMLKK LDLAVANITK EEDLDAIGSL 
Nprot SLEYLNL.SY CNLID66LPE EIGSLSSLKK LDLSRNNF.. EHLPSSIAQL 

PrfP 

rps2 



1101 1150 
L6pro EELVSLELEL DDTSSGIERI VSSSKLQKLT TLWKVPSLR EIEGLEELKS 
Nprot GALQSLDLK DCQRLTQLP ELPPELNELH .VDCHMALKF 

PrfP 

rps2 

1151 1200 

L6pro LQDLYLEGCT SLGRLPLEKL KE LD IGGCPDLTEL VQTWAVPSL 

Nprot IHDL.VTKRK KLHRVKLDDA HNDTMYNLFA YTMFQNISSM RHDISASDSL 

PrfP 

rps2 



Fig. 5A-3 
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1201 1250 
L6pro RGLTIRDCPR LEVGPMIQSL PKFPMLNELT LSMVNITKED ELEVLGSLEE 
Nprot .SLTV FTGQPYPEKI PSWFHHQGWD .SSVSVNLPE NWYIPDKFLG 

Prf P 

rps2 

1251 1300 



L6pro LD.SLELTLD DTCSSIERIS FLSKLQKLTT LIVEVPSLRE lEGLAELKSL 
Nprot FAVCYSRSLI DTTAHLIPVC .DDKMSRMTQ KLALSECDTE SSNYSEWD.I 

PrfP 

rps2 

1301 1350 

L6pro RILYL EGCTSLERL WPDQQQLGSL KNLNVLDIQG 

Nprot HFFFVPPAGL WDTSKANGKT PNDYGIIRLS FSGEEKMYGL RLLYKEGPEV 

PrfP 

rps2 

1351 1387 

L6pro CKSLSVDHLS ALKTTLPPRA RITWPDQPYR 

Nprot NALLQMRENS NEPTEHSTGI RRTQYNNRTS FYELIN6 

PrfP 

rps2 



Fig. 5A-4 
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N 2 
L6 51 
N 52 
L6 101 
N 101 
L6 151 
N 151 
L6 199 
N 200 
L6 249 
N 250 
L6 293 
N 295 
L6 343 
N 341 
L6 392 
N 391 
L6 442 
N 440 
L6 492 
N 490 
L6 542 



ASSSSSSRWSYDVFLSFRGEDTRKTFTSHLYEVLNDKGIKTFQDDKRLEY 51 

.•hi .hllllllMII. II- l|: |. |.||.||. I 

NPSGSFPSVEYEjVFLSFRGjPDTREQFTDFLYQSLRRYKIMTFRDDDELLK 100 

• • • • • 

GATIPGELCKAIEESQFAIVVFSENYATSRWCLNELVKIMECK.TRFKQT 100 
I • I • • :| • I h : I . : : . : : I . . I I . I : I I I | | . . | : . . : . . 

6KEI6PNLLRAIDQSKIYVPIISSGYADSKWCLMELAE1VRRQEEDPRR1 150 
7 * • • • • 

VIlPIFYDVDPSlHVRNQKESFAKAFEEHETKYKDDVEGIQRWRIALNEAAN 150 
::|||| ||||.||:|.::: ||| .|..|: |.:.|].|: ||...:: 

H jPIFYMVDPSP VRHQTGCYKKAFRKHANKF . . DGQTIQNWKDALKKVGD 198 



LKGSCDNRDKTDADCIRQIVDQISSKLCKISLSY.LQNIVGIDTHLEKIE 199 
III * • • • • • I I • • I I •••Mill- 

III* ••••••••• •• • • ••• |*|«« • 

LK6WH16KNDKQGAIADKVSAD1WSHISKENL1LETDELVGIDDHITAVL 248 



SLLEIGINGVRIMGIWGMGGVGKTTIARAIFDTLLGRMDSSYQFDGACFL 249 

• I ♦ • ♦ • • I • • I • • II II * II II • I • I • J J • • I h 1 1 • • 1 1 • 

EKLSLDSENVTMVGLy|gMGGIGKTTTAKAVYNKI SSC . FDCCCFI 2 92 

• . ■ • • • • 

KDIKE. .NKRGMHSLQNALLSELLR. . . EKANYNNEEDGKHQMASRLRSK 294 

.:|:| :1 |: 11- hlhll : . • . : | | : • : | : . : .|:.. 
DNIRETQEKDGVWLQKKLVSEILRIDSGSVGFNNDSGGRKTIKERVSRF 342 

• • • • • 

KVLIVLDDIDNKDHYLEYLAGDLDWFGNGSRIIITTRDKHLI EKND 340 

h|:||||:|:| : j::- |:::. IhUM-.::: N- 

KILWLDDVDEKFKFEDMLGSPKDFISQ. SRFIITSRSMRVLGTLNENQC 391 

• • • • • 

IIYEVTALPDHESIQLFKQHAFGKEVPNENFEKLSLEVVNYAKGLPLALK 3 90 

:|||..|... |::||..||| h-l • H-l- :||: • IIIMI 
KLYEVGSMSKPRSLELFSKHAFKKNTPPSYYETLANDWDTTAGLPLTLK 441 

VWGSLLHNLRLTEWKSAIEHMKNN. SYSGIIDNVKISYDGLEPKQQEmIfL 439 



VIGSLLFKQEIAVWEDTLEQLRRTLNLDEVYDRLKISYDALNPEAKEIFL 491 
9 

^ • • • * 



DIACFLRGEEKDYILQILESCHIGAEYGLRILIDKSLVFISEYNQVQMHD 489 
DIAcH fIGQNKEEPYYMWTDCNFYPASNIIFLIQRCMIQVGDDDEFKMHD 541 



JOl 



LIQDMGKYIVNFQKD . PGERSRLWLAKEVEEVMSNNTGTMAMEAIWVSSY 538 

••III- II- • • I • II I • I I • I • I • • I • —II-: • : 

QLRDMGREIVRREDVLPWKRSRIWSAEEGIDLLLNKKGSSKVKAISI .PW 590 
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N 539 SSTLRFSNQAVKNMKRLRVFNMGRSSTHYAIDYLPNNLRCFVCTNYPW. . 586 

I • I • I I • • • • I I I • • I 

• I** I ** * I II* * * * 

L6 591 6VKYEFKSECFLNLSELRYLHAREAMLTGDFNNLLPNLKWLELPFYKHGE 640 

• • • • • 

N 587 .ESFPSTFELKMLVHLQLRH NSLRHLWTETKHLPSL 621 

. . I I . . I I ••11. • I . 

L6 641 DDPPLTNYTMKNLIIVILEHSHITADDWGGWRHMMKMAERLKWRLASNY 690 

• • ■ • • • 

N 622 RRIDLSWSKRLTRTPDFTGMPNLEY. .VNLYQCSNLEEVHHSLGCC 665 

I I : II : . I : . : . : . : j . . : | | : : : . . | . . : . : . . . 
L6 691 SLYGRRVRLSDCWRFPKSIEVLSMTAIEMDEVDIGELKKLKTLVLKFCPI 740 

N 666 SKVIGLYLNDCKSLKRFPCVNVESLEYLGLRSCDSLEKLPEIYGRMKP. ! 713 

I* * • •• • •• !• • 

• *!* * • •••••••| •••• • •••• 

L6 741 QKISGGTFGMLKGLREL CLEFNWGTNLREVVADIGQLSSLK 781 

• • • • • 

N 714 EIQIHMQGSGIRELP.SSIFQYKTHVTKLLLWNM. . . .KNLV 750 

• • • •• • ••• • 

• • • • ** !* I ** ••• •! •••• ••• 

L6 7 82 VLKTTGAKEVEINEFPLGLKELSTSSRIPNLSQLLDLEVLKVYDCKDGFD 831 

• . • • 11 • • • 



N 751 ALPSSICRLKSLVSLSVSGCSKLESLPEEIGDLDNLRVFDASDTLILRP. 799 



L6 832 MPPASPSEDESSVWWKV. . . 



SKLKSLQLEKTRINVNWDDASSGGHLPRY 878 



N 800 PSSIIRLNKLIILMFRGFKDGVHFEFPPVAE 830 



L6 879 LLPTSLTYLKIYQCTEPTWLPGIENLENLTSLEVNDIFQTLGGDLDGL.Q 927 

1 2 • • • • * 

831 iGLHSLEYLjNLSYCNLID. . GGLPEEI . 6SLSSLKKLDL . .SRNNFEHLPS 875 

IhllM.:. .1 :. II.: : :| r-hh : :| Ih 

GLRSLEILRIRKVNGLARIKGLKDLLCSSTCKLRKFYITECPDLIELLPC 977 



N 876 SIA QLGALQSLDLKDCQRLTQLPELPPELNELHVDCHMALKFIHYL 921 

• • • • l*« ** • • 

• •• ••••••|**« ^ ••••• ** •••• ••• 

L6 978 ELGGQTVWPSMAELTIRDCPRL.EVGPMIRSLPKFPM LKKLDLA 1021 

• • • • • 

N 922 VTKRKKLHRVKLDDAHNDTMYNLFAYTMFQNISSMRHDISASDSLSLTVF 971 

|«» »! • •• •* • •• I** • * \ j |j * 

L6 1022 VANITKEEDLDAIGSLEELV. .SLELELDDTSSGIERIVSSSKLQKLTTL 1069 
N 972 TGQPYPEKIPSWFHHQGWDSSVSVN LPENWYIPDKFLGFAVCY 1014 

|«| * • •• • • •]•••• 

• | * • • ••• I** * •!• ••••• 

L6 1070 VV KVPSLREIE6LEELKSLQDLYLEGCTSLGRLPLEKLKELDIGG 1114 
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• • • • • 

N 1015 SRSLIDTTAHLIPVCDDK MSRMTQKLA LSECDTES 1049 

• • ••• • •• • 

• ••• •• • • I* • • 

L6 1115 CPDLTELVQTWAVPSLRGLTIRDCPRLEVGPMIQSLPKFPMLNELTLSM 1164 

• • • • • 

N 1050 SNYSEWDIHPFPVPFAGLWDTSKANGKTPNDYGIIRLSFSGEEKMYGLRL 1099 

|... I . . . |.: :| :|: .| : 

L6 1165 VNITKEDELEVLGSLEELDSLELTLDDTCSSIERISF.LSKLQKLTTLIV 1213 

• • • • 

N 1100 LYKEGPEVNALLQMRENSNEPTEHSTGIRRTQYNNRTSFYELIN 1143 

I •••[••• • •• • 

• •••• • I • •• r • • ml 

L6 1214 EVPSLREIEGLAELKSLRILYLEGCTSLER.LWPDQQQLGSLKN 1256 
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PXXaXX LXXLXXLXaXXXX aXXa 



5 05 PKAENW RQALVISLLD NR IQTL 

52 7 PEKLIC PK LTTLMLQQNSSLKKI. 

550 PTGFFMHMPVLRVLDLSFTS ITEI 

5 74 PLSIKY LVELYHLSMSGTK ISVL 

5 97 PQELGN LRKLKHLDLQRTQFLQTI 
621 PRDAICWLSKLEVLNLYYSYAGWEL 
658 FADLEY LENLTTLGITVLS LETL 

6 83 LFEFGALHKHIQHLHVEECNELLYF 
710 P SLTNHGRNLRRLSIKSCHDLEYL 
73 6 PADFENDWLPSLEVLTLHSLHNLTRV 
765 SVSQDC LRNIRCINISHCNKLKNV 
795 PK LEV lELFDCREIEELISEHES 
82 3 PT LFPSLKTLRTRDLPELNSI L 
84 5 PSRFS FQKVETLVITNCPRVKKL 
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Fig. 7 
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Fig. 9 
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I 10 I 20 I 30 I 40 I 50 I 60 
1 aagctttaca gattggatga tctcttaatg catgctgaag tgactgcaaa aaggttagca 60 
61 atattcagtg gttctcgtta tgaatatttc atgaacggaa gcagcactga gaaaatgagg 120 
121 cccttgttat ctgattttct gcaagagatt gagtctgtca aggtagagtt cagaaatgtt 180 
181 tgcttgcaag ttctggatat atcacctttt tccctgacag atggagaagg ccttgttaat 240 
241 ttcttattaa aaaaccaggc caaggtgccg aatgatgatg ctgtttcttc tgatggaagt 300 
301 ttagaggatg caagcagcac tgagaaaatg ggacttccat ctgattttct ccgagagatt 360 
361 gagtctgttg agataaagga ggccagaaaa ttatatgatc aagttttgga tgcaacacat 420 
421 tgtgagacga gtaagcacga tggaaaaagc tttatcaaca ttatgttaac ccaacaggac 480 
481 aaggtgctgg actatgatgc tggttcagtg tcttatcttc ttaaccaaat ctcagtagtt 540 
541 aaagacaaaa tattgcacat tggctcttta cttgtagata ttgtacagta ccggaatatg 600 
601 catatagaac ttacagatct cgctgaacgt gttcaagata aaaactacat tcgtttcttc 660 
661 tctgtcaagg gttatattcc tgcttggtat tacacactat atctctctga tgtcaagcaa 720 
721 ttgcttaagt ttgttgaggc agaggtaaag attatttgtc tgaaagtacc agattcttca 780 
781 agttatagct tccctaagac aaatggatta ggatatctca attgcttttt aggcaaattg 840 
841 gaggagcttt tacgttctaa gctcgatttg ataatcgact taaaacatca gattgaatca 900 
901 gtcaaggagg gcttattgtg cctaagatca ttcattgatc atttttcaga aagctatgtt 960 
961 gagcatgatg aagcttgtgg tcttatagca agagtttctg taatggcata caaggctgag 1020 
1021 tatgtcattg actcatgctt ggcctattct catccactct ggtacaaagt tctttggatt 1080 
1081 tctgaagttc ttgagaatat taagcttgta aataaagttg ttggggagac atgtgaaaga 1140 
1141 aggaacactg aagttactgt gcatgaagtt gcaaagacta ccactaatgt agcaccatct 1200 
1201 ttttcagctt atactcaaag agcaaacgaa gaaatggagg gttttcagga tacaatagat 1260 
1261 gaattaaagg ataaactact tggaggatca cctgagcttg atgtcatctc aatcgttggc 1320 
1321 atgccaggat tgggcaagac tacactagca aagaagattt acaatgatcc agaagtcacc 1380 
1381 tctcgcttcg atgtccatgc tcaatgtgtt gtgactcaat tatattcatg gagagagttg 1440 
1441 ttgctcacca ttttgaatga tgtgcttgag ccttctgatc gcaatgaaaa agaagatgga 1500 
1501 gaaatagctg atgatctacg ccgatttttg ttgaccaaga gattcttgat tctcattgat 1560 
1561 gatgtgtggg actataaagt gtgggacaat ctatgtatgt gcttcagtga tgtttcaaat 1620 
1621 aggagtagaa ttatcctaac aacccgcttg aatgatgtcg ccgaatatgt caaatgtgaa 1680 
1681 agtgatcccc atcatcttcg tttattcaga gatgacgaga gttggacatt attacagaaa 1740 
1741 gaagtctttc aaggagagag ctgtccacct gaacttgaag atgtgggatt tgaaatatca 1800 
1801 aaaagttgta gagggttgcc tctctcagtt gtgttagtag ctggtgttct gaaacagaaa 1860 
1861 aagaagacac tagattcatg gaaagtagta gaacaaagtc taagttccca gaggattggc 1920 
1921 agcttggaag agagcatatc tataattgga ttcagttaca agaatttacc acactatctt 1980 
1981 aagccttgtt ttctctattt tggaggattt ttgcagggaa aggatattca tgactcaaaa 2040 
2041 atgaccaagt tgtgggtagc tgaagagttt gtacaagcaa acaacgaaaa aggacaagaa 2100 
2101 gatacccgca caaggtttct tggacgatct tattggtagg aatctggtga tggccatgga 2160 
2161 gaagagacct aatgccaagg tgaaaacgtg ccgcattcat gatttgttgc ataaattctg 2220 
2221 catggaaaag gccaaacaag aggatttcct tctccagatc aataggtaaa aaaaactgta 2280 
2281 ttaattttac attacaaaaa aaaagaactg tattaatttt actgtattat gtttatgcca 2340 
2341 actctcattt ccatgtgttc tcttttattc aattcagtgg agaaggtgta tttcctgaac 2400 
2401 gattggaaga ataccgattg ttcgttcatt cttaccaaga tgaaattgat ctgtggcgcc 2460 
2461 catctcgctc taatgtccgc tctttactat tcaatgcaat tgatccagat aacttgttat 2520 
2521 ggccgcgtga tatctccttc atttttgaga gcttcaagct tgttaaagtg ttggatttgg 2580 



Fig. 1 2A 



ATTORNEY DOCKET NO.: 00786/254005 

Applicants: Frederick M. Ausubel et al. 
U.S. Serial No.: 10/613,765 
Filed: July 2, 2003 

Title: RPS GENE FAMILY. PRIMERS. PROBES, 
AND DETECTION METHODS 

Page 30 of 30 ) 



2581 aatcattcaa cattggtggt acttttccca 
2641 actttgcggc ccaaactgat gcaaattcaa 
2701 ttgagacttt tgtcgtaaga ggattgggag 
2761 agatggtgaa attgaggcat atacatgtaa 
2821 acatggatgt tttaactggt aactcacaat 
2881 gtctctttta tggtaaagac gcagagaaga 
2941 tgagttgcat attttcaggg acatttggtt 
3001 gttttcccag attagatttt ctaagtcacc 
3061 atccagccaa acttcctcac aagttcaatt 
3121 caaagttccg tctaccttgg acccaaattt 
3181 ttcttaagtt attgctcaga gcctttgaag 
3241 tcctagaact caaatactta aaactggaca 
3301 atgatgcttt tcctaagctt gaacatttgg 
3361 tcccttctcg ttttgaagat gctgtttgte 
3421 ggaatgttgc caattcagcc caagatattc 
3481 attcattcac agttactata cagcctccag 
3541 agcaaaggtt tgttcttgct gtgttcatcc 
3601 tacaccagaa catgtttatt ttgctagtat 
3661 tatttctgct acagtcttaa cttttcttgg 
3721 ttcatgtaat ttttaattca ctgtttcatt 
3781 atattgttat ccctagccaa atttattatg 
3841 ttttgtgaaa tgtttttgaa ttttttgcta 
3901 tgtcagctaa actctttgtc accagaagtg 
3961 gttctgtaga attttatgct tttgcagaat 
4021 cagagatagc agaagctaaa gttcaaggca 
4081 ttatgttgaa ttcttgaaaa gaagaagaat 
4141 tttttcttct tttagatgtt atttcttcat 
4201 aataaaaccg gagagaaaaa taagatcttt 
4261 taaatccata tggtacaact gtttgacaaa 
4321 taaaacagga ttgagaaaaa aatccttgca 
4381 tgtgtcaaag ttcccctctt taagtggaac 
4441 gacataccaa taccagctga ctgtctcatc 
4501 attagattag ttactaatga actggtaaat 
4561 ggtagacatg tatatatgaa gatacacgcg 
4621 ttgatttttt ttcttcacag agtatatatg 
4681 ttaactatta ccgtggatga aacaagcatg 
4741 atgtaaaaaa tggaggttct acgagcggta 
4801 ctgagacttg aaccatccat gtccaaggca 
4861 gagcctgcac taattaatct ccctgtatga 
4921 tgaaccaaaa gttacctttt ttttttcttc 
4981 ttagttgtaa attgtaatgg tgaagtgttt 
5041 ggttgtgtta tctttacaaa ccggaatcat 
5101 tttcggttta ttcattactc atttcagtaa 
1 10 I 20 I 30 



ttgaaacaca atatctaatt cagatgaagt 2640 
ttccttcatc tatagctaag cttgaaaatc 2700 
gagagatgat attaccttgt tcacttctga 2760 
atgatcgggt ttcttttggt ttgcgtgaga 2820 
aacctaattt ggaaaccttt tctactccgc 2880 
ttttgaggaa gatgccaaaa ttgagaaaat 2940 
attcaaggaa attgaagggt aggtgtgttc 3000 
ttgagtccct caagctggtt tcgaacagct 3060 
tcccctcgca actaagggaa ctgactttat 3120 
cgatcattgc agaactgccc aacttggtga 3180 
gggatcactg ggaagtgaaa gattcagagt 3240 
acctcaaagt tgtacaatgg tccatctctg 3300 
ttttaacgaa atgtaagcat cttgagaaaa 3360 
taaatagagt tgaggtgaac tggtgcaact 3420 
aaactatgca acatgaagtt atagcaaatg 3480 
attggtctaa agaacagccc cttgactctt 3540 
aagtgcattt aacatttatt cattttgttt 3600 
tacttgatac attaaaagaa atcgaactca 3660 
gcttacttga ggtctagatt agatcaatgg 3720 
caactgtctt atgatagttg tgaaatgaca 3780 
ttcaaatgaa aactgatgtc acaactactt 3840 
taaaattgac gaattgacag cttctatatt 3900 
tatttagaat tactgtggtt ttatgaaaga 3960 
atagtttaaa acaacaacac ttctctgttt 4020 
ttttgtttat ttctagaaca agtggagttc 4080 
caggagcagg taaagttatc tctttttatg 4140 
cttgaacgtg aacaccgctg aaagcatttt 4200 
ttatataaag cattatcatg taaatatgcc 4260 
atgatagaga ggggagtttt atagtataag 4320 
cgattttcaa tttctggcca catcacaatg 4380 
aagcaatcag aaaagctcat tcttatcggt 4440 
ttggttaact tagccttgct tacttagact 4500 
tggaaccaaa tgtagttagc ttgatgagct 4560 
taactttagt cgatggttaa tttttcattt 4620 
aacttggcct aaaagttttg cttcactaat 4680 
gcaacatttt caacaactat cactcaagca 4740 
catgtaagag ttttgtgcac acaagaggtt 4800 
gttgagatgc tagtaaagaa agaagaagat 4860 
atgagagaat gagaaaaaga tggagcttca 4920 
ttaatggcat tactttgaag cacatgtttg 4980 
gtaaatatag ggagtgatat ttgaaagaat 5040 
ttctgtataa ttttcttctg taatttttgg 5100 
gctt 5134 
I 40 I 50 I 60 



Fig. 12B 



